An information-theoretic analysis of Grover's 

algorithm 



Erdal Arikan 
Electrical-Electronics Engineering Department, 
Bilkent University, 06533 Ankara, Turkey. 
arikan@ee.bilkent.edu.tr 



O 



q i February 1, 2008 



> 
00 

O 



Abstract 



Grover discovered a quantum algorithm for identifying a target element in an un- 
structured search universe of N items in approximately n/A^/N queries to a quantum 
' oracle. For classical search using a classical oracle, the search complexity is of order 

N/2 queries since on average half of the items must be searched. In work preceding 
Grover's, Bennett et al. had shown that no quantum algorithm can solve the search 
problem in fewer than 0(VN~) queries. Thus, Grover's algorithm has optimal order of 
complexity. Here, we present an information-theoretic analysis of Grover's algorithm 
Qh! and show that the square-root speed-up by Grover's algorithm is the best possible by 

any algorithm using the same quantum oracle. 
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H ! 1 Introduction 



Grover [Q] discovered a quantum algorithm for identifying a target element in an un- 
structured search universe of N items in approximately ir/A^/N queries to a quantum oracle. 
For classical search using a classical oracle, the search complexity is clearly of order N/2 
queries since on average half of the items must be searched. It has been proven that this 
square-root speed-up is the best attainable performance gain by any quantum algorithm. 
In work preceding Grover's, Bennett et al. j|] had shown that no quantum algorithm can 
solve the search problem in fewer than 0{s/N) queries. Following Grover's work, Boyer 
et al. Q showed that Grover's algorithm is optimal asymptotically, and that square-root 
speed-up cannot be improved even if one allows, e.g., a 50% probability of error. Zalka 
H strengthened these results to show that Grover's algorithm is optimal exactly (not only 
asymptotically). In this correspondence we present an information-theoretic analysis of 
Grover's algorithm and show the optimality of Grover's algorithm from a different point of 
view. 
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2 A general framework for quantum search 



We consider the following general framework for quantum search algorithms. We let X 
denote the state of the target and Y the output of the search algorithm. We assume that 
X is uniformly distributed over the integers through N — 1. Y is also a random variable 
distributed over the same set of integers. The event Y = X signifies that the algorithm 
correctly identifies the target. The probability of error for the algorithm is defined as 
P e = P(Y^X). 

The state of the target is given by the density matrix 

AT-l 

PT= £(1/JV)|*><*|, (1) 
x=0 

where {|x)} is an orthonormal set. We assume that this state is accessible to the search 
algorithm only through calls to an oracle whose exact specification will be given later. The 
algorithm output Y is obtained by a measurement performed on the state of the quantum 
computer at the end of the algorithm. We shall denote the state of the computer at time 
k = 0, 1, . . . by the density matrix pc{k). We assume that the computation begins at time 
with the state of the computer given by an initial state pc(0) independent of the target 
state. The computer state evolves to a state of the form 

N-i 

PC (k) = j2(yN)p x (k) (2) 

x=0 

at time k, under the control of the algorithm. Here, p x (k) is the state of the computer 
at time k, conditional on the target value being x. The joint state of the target and the 
computer at time k is given by 

N-i 

PTc(k) = Yl (W)\x)(x\ ® Px (k). (3) 

x=0 

The target state @) and the computer state (||) can be obtained as partial traces of this 
joint state. 

We assume that the search algorithm consists of the application of a sequence of unitary 
operators on the joint state. Each operator takes one time unit to complete. The compu- 
tation starts at time and terminates at a predetermined time K, when a measurement is 
taken on pc(K) and Y is obtained. In accordance with these assumptions, we shall assume 
that the time index k is an integer in the range to K, unless otherwise specified. 

There are two types of unitary operators that may be applied to the joint state by a search 
algorithm: oracle and non-oracle. A non-oracle operator is of the form I ®U and acts on 
the joint state as 

p TC (k + l) = {I®U)p TC (k)(I®Uy = J2(VN)\x){x\®U Px (k)tf. (4) 

x 

Under such an operation the computer state is transformed as 

pc(k + l) = U Pc (k)Ul (5) 
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Thus, non-oracle operators act on the conditional states p x {k) uniformly; p x (k + 1) = 
U p x (k)U^ . Only oracle operators have the capability of acting on conditional states non- 
uniformly. 

An oracle operator is of the form ^2 X \x){x\ <g> O x and takes the joint state pTc(k) to 

PTc(k + 1) = £(l/JV)|s)(x| ® O xPx (k)Ol (6) 

X 

The action on the computer state is 

p c (k + l) = Y,(yN)O xPx (k)Ol (7) 



All operators, involving an oracle or not, preserve the entropy of the joint state pTc(k). 
The von Neumann entropy of the joint state remains fixed at S[pTc(k)] = log N throughout 
the algorithm. Non-oracle operators preserve also the entropy of the computer state; the 
action (||) is reversible, hence S[pc(k + 1)] = S[pc(k)]. Oracle action on the computer state 
(0), however, does not preserve entropy; S[pc(k + 1)] ^ S[po(k)], in general. 

Progress towards identifying the target is made only by oracle calls that have the capability 
of transferring information from the target state to the computer state. We illustrate this 
information transfer in the next section. 



3 G rover's algorithm 

Grover's algorithm can be described within the above framework as follows. The initial 
state of the quantum computer is set to 

pc(0) = \s)(s\ (8) 

where 

7V-1 

\a)= ^(l/VAQIx). (9) 

x=0 

Since the initial state is pure, the conditional states p x (k) will also be pure for all k > 1. 

Grover's algorithm uses two operators: an oracle operator with 

O x = I-2\x)(x\, (10) 
and a non-oracle operator (called 'inversion about the mean') given by / <g> U s where 

U s = 2\s){s\-I. (11) 

Both operators are Hermitian. 
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Grover's algorithm interlaces oracle calls with inversion-about-the-mean operations. So, it 
is convenient to combine these two operations in a single operation, called Grover iteration, 
by defining G x = U s O x . The Grover iteration takes the joint state prc(k) to 

p TC (k + 1) = J2(l/N)\x){x\ ® G xPx (k)Gl (12) 

X 

In writing this, we assumed, for notational simplicity, that G x takes one time unit to 
complete, although it consists of the succession of two unit-time operators. 



Grover's algorithm consists of K = {ir/4)^/N successive applications of Grover's iteration 
beginning with the initial state (@), followed by a measurement on pc{K) to obtain Y. The 
algorithm works because the operator G x can be interpreted as a rotation of the x—s plane 
by an angle 6 = arccos(l — 2/N) « 1j\[N radians. So, in K iterations, the initial vector 
\s), which is almost orthogonal to \x), is brought into alignment with \x). 

Grover's algorithm lends itself to exact calculation of the eigenvalues of pc(k), hence to 
computation of its entropy. The eigenvalues of pc(k) are 

Ai(fc) = cos 2 {9k) (13) 

of multiplicity 1, and 

A.W = ^ (14) 

of multiplicity N — 1. The entropy of pc{k) is given by 

S{ Pc {k)) = -\ l {k)\og\ 1 {k)-{N -l)\ 2 (k)\og\ 2 (k) (15) 

and is plotted in Fig. |l| for N = 2 20 . (Throughout the paper, the unit of entropy is bits and 
log denotes base 2 logarithm.) The entropy S(p c (k)) has period tt/9 « (tt/2)\/~N. 

Our main result is the following lower bound on time-complexity. 



Proposition 1 Any quantum search algorithm that uses the oracle calls {O x } as defined 
by (1C) must call the oracle at least 

K>( ] -^ + ^—)VN (16) 



2-7T 7T log iV 

times to achieve a probability of error P e . 



For the proof we first derive an information-theoretic inequality. For any quantum search 
algorithm of the type described in section 0, we have by Fano's inequality, 

H(Y\X) < H(P e ) + P e \og(N - 1) < H(P e ) + P e log(A0, (17) 

where for any < u < 1 

H(u) = -5 log 5 - (1 - S) log(l - S). (18) 
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Figure 1: Evolution of entropy in Grover's algorithm. 



On the other hand, 

H{X\Y) = H(X)-I(X;Y) 
= log N - I(X;Y) 

> logN-S( Pc (K)) (19) 
where in the last line we used Holevo's bound || p. 531]. 

Let /ifc be the largest eigenvalue (sup-norm) of pc(k)- We observe that /ik begins at time 
with the value 1 and evolves to the final value \xk at the termination of the algorithm. We 
have 

S{pc{K)) < -fi K logpK-(l-m)^g[(l-fi K )/(N-l)] < H(/^) + (l-w01ogiV. (20) 

since the entropy is maximized, for a fixed px, by setting the remaining N — 1 eigenvalues 
equal to (1 - fM K )/(N - 1). Combining (jl|) and pi), 



PK log N < P e log N + H{P e ) + H(fi K ) < Pe log N + 2 (21) 

Now, let 

A = sup{|/i fc+1 - fj, k \ : k = 0, 1, . . . ,K - 1}. (22) 
This is the maximum change in the sup norm of pc(k) per algorithmic step. Clearly, 

K> 1 -^. 
A 
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Using the inequality pl|), we obtain 

l-Pe + 2/logiV 
K > — — . (23) 

Thus, any upper bound on A yields a lower bound on K. The proof will be completed by 
proving 

Lemma 1 A < 2tt/Vn. 

We know that operators that do not involve oracle calls do not change the eigenvalues, hence 
the sup norm, of pcik). So, we should only be interested in bounding the perturbation of 
the eigenvalues of pc(k) as a result of an oracle call. We confine our analysis to the oracle 
operator ( jlOj ) that the Grover algorithm uses. 

For purposes of this analysis, we shall consider a continuous-time representation for the 
operator O x so that we may break the action of O x into infinitesimal time steps. So, we 
define the Hamiltonian 

H x = -ir\x)(x\ (24) 

and an associated evolution operator 

O x (r) = e- iTH * = I+(e^ T -l)\x)(x\. 
The operator O x is related to O x (t) by O x = O x (l). 

We extend the definition of conditional density to continuous time by 

Px{ko + t) = O x (r)p x (k )O x (T)^ (25) 
for < t < 1. The computer state in continuous-time is defined as 

PC (t)=J2(l/N)p x (t). (26) 

X 

Let {X n (t), u n (t)}, n = 1, . . . , N, be the eigenvalues and associated normalized eigenvectors 
of pc{t). Thus, 

Pc{t)\u n {t)) = X n (t)\u n (t)} , (u n (t)\p c (t) = \ n (t)(u n (t)\, (u n (t)\u m (t)) = 5 ntm . (27) 
Since pc(t) evolves continuously, so do X n (t) and u n (t) for each n. 

Now let (X(t),u(t)) be any one of these eigenvalue-eigenvector pairs. By a general result 
from linear algebra (see, e.g., Theorem 6.9.8 of Stoer and Bulirsch p. 389] and the 
discussion on p. 391 of the same book), 

<*A(t) , dp c (t) 

= (<t)\^ r \u{t)). (28) 
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To see this, we differentiate the two sides of the identity X(t) = (u(t)\pc (t)\u(t)) , to obtain 



^ = (n'(t)| /3c (t)|n(t)) + (n(t)|^^|u(t)) + (n(t)| /3c (t)|n / (t)) 
= {u{t )\^P-\u(t)) + A(t)[(u'(t)|u(t)> + (u(t)\u'(t))] 

= (<t)\^^nt)) + mj t Ht)\u(t)) 

= (u(t)\ dj ^\u(t)) 

where the last line follows since (u(t)\u(t)) = 1. 

Differentiating (p6[), we obtain 

d Pc {t) 



dt 



J2-(i/N)[H x ,p x (t)] 



(29) 



where [•, •] is the commutation operator. Substituting this into (^q), we obtain 



d\(t) 




dt 






< 




(a) 




< 








< 



2 

N 
N 

iV' 



Y,Ht)\H xPx (t)\u(t)) 



/EM*)|fl2l«(t))jE<«(*)l^(t)l«(t)) 



vr 2 I (u(t) |x) | 2 ^V<«(t)|p c (t)Kt)> 



2tt 



2vr 



1 • JA(t) 



where (a) is the Cauchy-Schwarz inequality, (6) is due to (i) p x (t) = p x (t) as it is a pure 
state, and (ii) the definition (^). Thus, 



|A(fco + l)-A(fco)| 



fco + l 



fro 



dA(t) 
dt 



dt 



< 2tt/VN. 



(30) 



Since this bound is true for any eigenvalue, the change in the sup norm of pc{t) is also 
bounded by 2tt/Vn. 



Discussion 



The bound (16) captures the \f N complexity of Grover's search algorithm. As mentioned 
in the Introduction, lower-bounds on Grover's algorithm have been known before; and, in 



7 



fact, the present bound is not as tight as some of these earlier ones. The significance of 
the present bound is that it is largely based on information-theoretic concepts. Also worth 
noting is that the probability of error P e appears explicitly in dl6|), unlike other bounds 
known to us. 
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